feat: add caching if prompt request fails #148

Matthieu-OD · 2024-11-14T05:53:00Z

Add Prompt Caching Functionality

Changes

Implemented a prompt caching system using a separate class SharedCachePrompt

Implementation Details

Cache keys are generated in three formats:
id
name
tuple(name, version)
Caching logic implemented in both sync and async get_prompt methods
On each successful get prompt we will cache the prompt with keys id, name, tuple(name, version)
Added fallback to cached prompts when API calls fail, with warning logs
To keep in mind this cache will be cleared each time the application Is restarted

Test Commands

Test Code

from fastapi import FastAPI, HTTPException
from typing import Optional
from literalai import LiteralClient

app = FastAPI()
client = LiteralClient(
    url="http://localhost:3000", api_key="my-initial-api-key"
)

@app.get("/prompt")
async def get_prompt(
    id: Optional[str] = None,
    name: Optional[str] = None,
    version: Optional[int] = None,
    should_fail: bool = False  # For testing failure scenarios
):
    print(client.api._prompt_cache.keys())
    try:
        if should_fail:
            original_key = client.api.api_key
            client.api.api_key = "invalid_key"
            try:
                prompt_from_cache = client.api.get_prompt(
                    id=id, name=name, version=version
                )
                return {
                    "success": True,
                    "prompt_id": prompt_from_cache.id,
                    "cache_hit": True
                }
            finally:
                client.api.api_key = original_key
        else:
            prompt = client.api.get_prompt(id=id, name=name, version=version)
        return {
            "success": True,
            "prompt_id": prompt.id,
            "cache_hit": False
        }
    except Exception as e:
        raise HTTPException(status_code=500, detail=str(e))

Install dependencies

pip install fastapi uvicorn

Run test server

uvicorn test_cache:app --reload

Test normal flow

curl "http://localhost:8000/prompt?name=example_prompt"

Test cache with failure

First call to populate cache

curl "http://localhost:8000/prompt?name=example_prompt"

Second call with simulated failure

curl "http://localhost:8000/prompt?name=example_prompt&should_fail=true"

Test with ID

curl "http://localhost:8000/prompt?id=prompt_123"

Test with name and version

curl "http://localhost:8000/prompt?name=example_prompt&version=1"

willydouhard · 2024-11-15T10:56:35Z

literalai/api/__init__.py


+        sync_api = LiteralAPI(self.api_key, self.url)
+        cached_prompt = self.prompt_cache.get(id, name, version)
+        timeout = 1 if cached_prompt else None


you could move the cache logic in the get_prompt_helper to avoid duplicating it for the sync/async versions.

willydouhard · 2024-11-15T10:59:09Z

literalai/api/__init__.py

                json={"query": query, "variables": variables},
                headers=self.headers,
-                timeout=10,
+                timeout=timeout,


I don't see the modification for the async version of make_gql_call

It is lines 1519 and 1532 in the same file

willydouhard

Looks promising! I would love to see a test for it.

Matthieu-OD · 2024-11-19T16:25:59Z

After discussing with @clementsirieix, I'm changing the implementation to avoid a too specific or complex solution. I'm also increasing the timeout to 2 sec instead of 1.

Matthieu-OD self-assigned this Nov 14, 2024

Matthieu-OD added 3 commits November 14, 2024 10:28

feat: create the dict cache and the method to go with it

4ddda3e

feat: get_prompt add caching

fd9f462

feat: implement caching on get_prompt

8774a00

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from 6136af7 to 8774a00 Compare November 14, 2024 10:07

fix: ci

5d8b5f7

Matthieu-OD marked this pull request as ready for review November 14, 2024 10:24

feat: add timeout if prompt cached

e7589c6

Matthieu-OD marked this pull request as draft November 14, 2024 13:23

Matthieu-OD added 2 commits November 14, 2024 14:31

feat: improve caching

723f7fd

feat: improve logging

32b971f

Matthieu-OD marked this pull request as ready for review November 14, 2024 14:15

fix: ci errors

5bcdce4

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from ae555e3 to 5bcdce4 Compare November 14, 2024 14:44

Matthieu-OD added 3 commits November 15, 2024 10:56

feat: improve the prompt cache class

2476ebc

refactor: remove useless code

0aec701

feat: implement the new SharedCachePrompt class

32b4e48

willydouhard reviewed Nov 15, 2024

View reviewed changes

willydouhard suggested changes Nov 15, 2024

View reviewed changes

Matthieu-OD added 3 commits November 15, 2024 12:42

refactor: improve typing and move some logic

f5d460b

feat: adds memory management to the SharedCachePrompt class

49fd140

feat: add unit tests for SharedCachePrompt

3e139f2

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch 7 times, most recently from 2f9628c to 3433ee1 Compare November 18, 2024 13:44

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch 2 times, most recently from 7ffb0b3 to 896e303 Compare November 18, 2024 13:54

feat: adds tests and updates run-test.sh

3730581

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from 896e303 to 3730581 Compare November 18, 2024 13:58

Matthieu-OD requested a review from willydouhard November 18, 2024 14:00

Matthieu-OD added 2 commits November 19, 2024 17:55

refactor: finishes the simplication

5318751

fix: test and implementation

85c72d1

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from 0bd9044 to 85c72d1 Compare November 20, 2024 09:52

Matthieu-OD added 2 commits November 20, 2024 10:58

fix: add typing for sharedcache typing

6dfce9c

feat: align with literalai-typescript chagnes

06c5047

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from 2390ad2 to 06c5047 Compare November 21, 2024 16:00

Matthieu-OD added 3 commits November 28, 2024 11:23

Merge branch 'main' into matt/eng-2115-add-client-caching-for-prompts

e3c7ea0

fix: ci

cf98d74

fix: more ci fixes

c5faa02

Matthieu-OD force-pushed the matt/eng-2115-add-client-caching-for-prompts branch from 0d93fd2 to c5faa02 Compare November 28, 2024 10:45

willydouhard approved these changes Nov 29, 2024

View reviewed changes

Matthieu-OD merged commit 87274ae into main Nov 29, 2024
2 checks passed

desaxce deleted the matt/eng-2115-add-client-caching-for-prompts branch November 29, 2024 15:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add caching if prompt request fails #148

feat: add caching if prompt request fails #148

Uh oh!

Matthieu-OD commented Nov 14, 2024 •

edited

Loading

Uh oh!

willydouhard Nov 15, 2024

Uh oh!

willydouhard Nov 15, 2024

Uh oh!

Matthieu-OD Nov 15, 2024 •

edited

Loading

Uh oh!

willydouhard left a comment

Uh oh!

Matthieu-OD commented Nov 19, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: add caching if prompt request fails #148

feat: add caching if prompt request fails #148

Uh oh!

Conversation

Matthieu-OD commented Nov 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add Prompt Caching Functionality

Changes

Implementation Details

Test Commands

Install dependencies

Run test server

Test normal flow

Test cache with failure

Uh oh!

willydouhard Nov 15, 2024

Choose a reason for hiding this comment

Uh oh!

willydouhard Nov 15, 2024

Choose a reason for hiding this comment

Uh oh!

Matthieu-OD Nov 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

willydouhard left a comment

Choose a reason for hiding this comment

Uh oh!

Matthieu-OD commented Nov 19, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Matthieu-OD commented Nov 14, 2024 •

edited

Loading

Matthieu-OD Nov 15, 2024 •

edited

Loading